An Analysis of Off-line and On-line Approaches in Urdu Character Recognition
نویسندگان
چکیده
In this research article a detailed analysis has been proposed for various offline and online character recognition systems for Urdu script from year 2002 to 2012. This analysis is based on the Methodology, Text Type, Font, Recognition Level, Sample and Accuracy Level achieved by each individual Urdu script recognition system. This paper attempts to cover various aspects of offline and online character recognition systems to provide wide exposure to this research topic with special emphasis on Urdu Script. Generally, character recognition is the capability of a computer system to comprehend printed or handwritten text from different sources like documents, books, reports, photographs or directly from digital touch screens. In Offline Character Recognition system, an image is sensed by a scanner having printed text. When using any digital device in real time for example a touch-screen or a digital pen, it is referred to as Online Character Recognition.
منابع مشابه
The Optical Character Recognition for Cursive Script Using HMM: A Review
Automatic Character Recognition has wide variety of applications such as automatic postal mail sorting, number plate recognition and automatic form of reader and entering text from PDA's etc. Cursive script’s Automatic Character Recognition is a complex process facing unique issues unlike other scripts. Many solutions have been proposed in the literature to solve complexities of cursive scripts...
متن کاملLine and Ligature Segmentation in Printed Urdu Document Images
This paper presents a technique for segmentation of printed Urdu text images into lines and ligatures, a key pre-processing step in Urdu Optical Character Recognition (OCR) systems. Unlike classical projection profile based line segmentation methods, the proposed scheme successfully segments overlapping and touching lines. Once the lines are segmented, ligatures are extracted from each text lin...
متن کاملA New Large Urdu Database for Off-Line Handwriting Recognition
A new large Urdu handwriting database, which includes isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), and Urdu dates in different patterns, was designed at Centre for Pattern Recognition and Machine Intelligence (CENPARMI). It is the first database for Urdu off-line handwriting recognition. It ...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملUCOM offline dataset-an urdu handwritten dataset generation
A benchmark database for character recognition is an essential part for efficient and robust development. Unfortunately, there is no comprehensive handwritten dataset for Urdu language that would be used to compare the state of the art techniques in the field of optical character recognition. In this paper, we present a new and publically available dataset comprising 600 pages of handwritten Ur...
متن کامل